Language-Independent Named Entity Recognition (II)
The training, development and test data sets for English and German as well as evaluation software for this shared task in one gzipped tar file.
BUILDING THE TRAIN AND TEST DATA FILES
bin/make.eng
This will generate the training data (either eng.train or deu.train), the development test data (eng.testa or deu.testa) and the final test data (eng.testb or deu.testb) in the ner directory.
code:手順.sh
tar zxf ner.tgz
cd ner
bin/make.engにはロイターのデータのCDをマウントしておく必要がある
bin/make.eng.2016にはrcv1.tar.xz(ロイターのデータ)が必要